Quantum correlations in predictive processes 
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We consider tiie role of quantum correlations in the efficient use of information by a predictive 
quantum system, generalizing a recently proposed classical measure of non-predictive information 
to the quantum regime. We show that, as a quantum system changes state, the non- predictive 
information held by another correlated quantum system is exactly equal to the extractable work 
that is lost from the second system. We use quantum discord to quantify the quantum contribution, 
and demonstrate the possibility of improved thermodynamic efficiency due to a negative "quantum 
part" of the lost work. We also give a thermodynamic interpretation to quantum discord, as the 
reduction in extractable work under an optimal classical approximation of one of the systems. 



A system which by interacting with its environment 
tries to predict the future of some part of its surround- 
ings is essentially involved in the process of learning. In 
a realistic situation, the changing environment must be 
modeled as a stochastic process. From an information 
theory point of view, it may be thought of as a source 
emitting a random message, x, with probability Px{t). 
The system changes its state in response to this message 
and can be thought to perform an implicit computation 
of the environment's variables the states of the system 
and environment become correlated. As the evolution of 
the environment exhibits temporal correlations, the sys- 
tem becomes correlated with the past of its surroundings 
and also, implicitly, with their future. From a practical 
perspective, the most useful model is one that posseses 
predictive power without being unnecessarily complex — 
as much retained information as possible should be useful 
for predicting the future of the surroundings [ll, 0] • In a 
recent publication. Still et al. [l[ showed that an informa- 
tion theoretic measure of the inefficiency of the predictive 
process (the non-predictive information) is, under certain 
conditions, equal to thermodynamic inefficiency, i.e., the 
dissipated work during the evolution. The result shows a 
deep connection between effective use of information and 
efficient thermodynamic operation. 

Their discussion was limited to the case where both 
system and environment are classical. In this paper we 
make an extension to quantum parties. As motivation we 
note, first, that real environments are quantum, so quan- 
tum effects must come into play for microscopic systems 
at some stage. Second, we might ask whether there can 
be a non-classical advantage in the effective use of infor- 
mation, and thus in thermodynamic efficiency. 

Considering systems that are quantum, classical states 
must be replaced by quantum states — for uncertain x, by 
mixed states p(t) = Y^x Pxi't)\^xi''^)) i'^x{t)\; importantly, 
quantum states are not distinguishable in general. Fur- 
thermore, quantum systems can be correlated in ways 
that are inaccessible to a classical stochastic processes, 
e.g., through non-local entanglement It is natural 
then to ask what type of correlations might constitute a 



resource for efficient operation in the quantum case. 

We show that a relation between lost work and predic- 
tive inefficiency holds also for quantum systems, in anal- 
ogy with the classical case. Since the shared information 
may now involve quantum correlations, we nonetheless 
find different behavior for the predictive power and pro- 
cess complexity. To quantify any quantum advantage, 
we adopt quantum discord [J] as a measure of quantum 
correlations. Interestingly, we find that entanglement, 
although potentially providing greater ability to make 
predictions, cannot improve the efficiency of the process, 
as the improved predictive power necessarily comes at 
the price of greater process complexity (more informa- 
tion about the environment must be retained). We will, 
however, see that coherences in the system that is change 
by its environment can improve predictive efhciency, to 
a degree quantified by a negative (quantum) part of the 
lost work. This result assigns an operational meaning to 
quantum discord: it is the thermodynamic inefficiency of 
the most energetically efficient classical approximation of 
a quantum memory. 

Central to our results is the extension of Landauer's 
principle to quantum systems recently made by Rio et 
al. Landauer's principle states that the erasure of 
information from a memory must necessarily lead to the 
generation of heat, at minimum in proportion to its en- 
tropy 0. The converse of this statement, which will be of 
use to us, is that any memory not in a state of maximum 
entropy can in principle be used to perform work. In 
extending this to quantum systems, one must take into 
account any "quantum side information" held about the 
system in its environment Q ■ 

Let S' be a quantum system, and denote by X some 
part of the surrounding environment that is correlated to 
S; we assume that 5* is finite and consists of TV qubits. 
The setup might represent some physical system respond- 
ing to a changing environment whose state is encoded in 
X . From a thermodynamic point of view, we will view 
information held in S" as a potential source of work. 

Consider now that X undergoes an evolution, px — > 
p'y = £{px), with £ a quantum channel. Meanwhile, in- 
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formation about X is held by S, and some of this infor- 
mation is useful for making predictions about the evolu- 
tion. The two systems are represented by quantum state 
Psxj and the evolution of X can be written as a local 
map: psx p'sx = h® £{psx), where Uspsx = Px, 
tT^ sp'sx ~ P'x' ^n*^ is the identity on S. Different psx, 
constrained by tr spsx = Px, represent different mod- 
els encoding information about X. If we are interested 
in making predictions about the future state, a model 
might contain redundant information at the level of ac- 
curacy required. Wc can quantify information about the 
past state of X, which we refer to as the memory, by 
the mutual information I{S : X) := H{ps) - H{S\X), 
where H{p) = — tr[plog2/3] is the von-Neumann entropy 
measured in bits, H{S\X) = H{psx) — H{px) is the 
conditional quantum entropy of S given X, and ps de- 
notes the reduced state of S. Similarly, the predictive 
power, the information we have about the future state, 
can be quantified by the mutual information I{S : X') := 
H{ps) - HiS\X'), where HiS\X') = H{p'sx) - H[p'^). 
If two models achieve the same predictive power, the one 
containing the least information about the past displays 
a more efficient use of information. 

Recently, Still et al. [if proposed that the difference 
I{S : X) — I{S : X') serves as a natural and meaning- 
ful measure of predictive inefficiency. We may view it as 
non-predictive information. Classically, the optimization 
of this quantity can be seen as a special case of the infor- 
mation bottleneck method, where I{S : X) is minimized 
at fixed I{S : X') H 0]. It was also shown in P, that 
for classical systems, this measure equals the work lost 
when one considers extractable work from S* at a certain 
inverse temperature /3 — l/ksT. We show that the same 
identity holds for quantum systems. 

As the state of X is changed, work is performed on 
the total system psx- We then ask: docs the change in 
X change the work recoverable from S? Assuming all 
states are energetically equivalent — i.e., a fully degener- 
ate Hamiltonian — it was shown in Q that one can extract 
an amount of work We^t[S\X] = ^-^[N - H{S\X)] In 2 
from 5, at inverse temperature /3, with the state of X 
kept intact Q . This holds in the asymptotic limit where 
work extraction is considered on a large number of copies 
of the system. We adopt this limit throughout this paper 
(see for a discussion). We then define a thermody- 
namic inefficiency for S, at inverse temperature /3, as the 
difference of Woxt[<S'|A'] and Woxt['S'|A''], i.e., the negative 
change in work that can be extracted from S keeping the 
surroundings intact: 

most [psx ^ p'sx] [^(^1^0 - ^(^1^)] In 2. (1) 

It follows that 

most [psx p'sx] = [HS : X) - I{S : X')] In 2, (2) 

which shows that, just as in the classical case, predictive 
inefficiency equals thermodynamic inefficiency. Note that 



W^iost is positive, as mutual information cannot increase 
over a local map on X (the data processing inequality, 
see for example [l^l) 

Equation ([T|) involves quantum correlations. It is thus 
not surprising that quantum effects enter into the effec- 
tive use of information and thermodynamic efficiency as 
we define them. Quantum discord [3| makes this explicit. 
It measures how shared quantum information deviates 
from classical mutual information. For a definition we 
first introduce a "semiclassical" conditional entropy — a 
measure of the uncertainty left about 5* when a rank 
one projective measurement is performed on X (or vice 
versa). Denoting the measurement by {Pk}, with k an 
outcome, pk = Is ® PkPsxIs ® Pk/Pk is the system state 
conditioned on fc, withp^ = tT:[Is® PkPsx] the outcome 
probability. Now define H{S\X = k) := H{pk) as the 
conditional entropy of S given outcome k measured on 
X, and introduce H{S\X^) := ininJ2kPkH{S\X = k) 
as a conditional entropy, where the minimum is taken 
over all rank one projective measurements. This quan- 
tity may be seen as an alternative to the usual quantum 
conditional entropy, H[S\X), generalized from the con- 
ditional Shannon entropy for two classical random vari- 
ables Quantum discord is defined as the (positive) 
difference 5{S\X) := H{S\X(^) - H{S\X). 

Wc use 5{S\X) to quantify how quantum correlations 
enter into the thermodynamic efficiency of our model, 
as given by Eq. ([T|); conversely, through Eq. ([TJ, we 
assign a thermodynamic interpretation to quantum dis- 
cord. Note, first, that approximating X as classical can 
be viewed as sending it down a decoherence channel, 
Psx P^x = Y^k^S ® PkPsxIs ® Pk- Consider now 
the channel that minimizes the work lost according to 
Eq. ID). From the definition of quantum discord, 

min /Jl^iost [psx p'^x] = KS\X) In 2, (3) 

where the minimum is taken over all decoherence chan- 
nels acting locally on X. In other words, quantum dis- 
cord gives the energetic inefficiency, measured as lost 
work potential, of the most energetically efficient clas- 
sical approximation of X, when viewing the information 
held by S' as a source of work. It is also then equal to the 
non-predictive information in this decoherence process. 
See |12h16I | for other interpretations of quantum discord. 

Considering mutual information written as I{S : X) = 
H{ps) — H{S\X), we introduce as a natural measure of 
classical correlations, J'='(S'| a:) := H{ps)-H{S\X'=') 
We then divide the total correlations into a "classical" 
and a "quantum" part: I{S : X) = {S\X) + S{S\X) = 
I^{X\S) + 6iX\S), where we define I^{X\S) and S{X\S) 
with the projective measurement performed on S instead 
of AT. Of course, we can divide the predictive informa- 
tion, I{S : X'), into parts in exactly the same manner. 
Importantly, in general S{S\X) ^ 6{X\S), and either 
of these two quantities greater than zero signals non- 
classical correlations. 
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The lost work acquires classical and quantum parts 
from its dependence on I{S : X) and I{S : X') [Eq. 
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Wlost[pSX p'sx] = W^lc 



,,US\X) + W,l,iS\X) (4) 
t 

{X\S) + W,l,iX\S), 

where pW^^;^^ {S\X) := - I^^Q {S\X')]\ii2, 

and an analogous expression applies when the projective 
measurement is performed on S. 

In the quantum case one typically expects to have non- 
zero quantum discord both before and after the map on 
X [l8[. In other words, there is typically some quantum 
advantage in predictive power, I{S : X'), though, at the 
same time, a larger process complexity, I{S : X), due to 
quantum correlations. There is an advantage in thermo- 
dynamic efhciency only when Wj^^j < 0. This is poss ible 



because discord can be created by a local map [19|,|20j , an 



important contrast with entanglement. Indeed, the map 
Is (8) f , cannot increase entanglement [2lj, and the type 
of correlations generated here relate to orthogonal (dis- 
tinguishable) quantum states being mapped onto non- 
orthogonal (indistinguishable) states. The map cannot 
generate quantum correlations of the kind measured by 
5{X\S) either, as it acts locally on X Any quantum 
advantage must therefore be quantified by Wl^^^{S\X), 
and, in particular, no such advantage is possible for a 
system learning from a classical environment; although 
there might still be an advantage to the predictive power, 
as measured by 6{X'\S). 

We now apply these results to a situation where the 
state, psx, prior to the update on X results from a 
process where S and X, initially uncorrelated, interact 
through a common one-way reservoir. Potentially, the 
interaction can entangle and correlate the systems, and 
it is interesting to consider whether a quantum advan- 
tage to predictive efficiency can arise as information flows 
from X to 5*. This is a close quantum analog of the pro- 
cess considered in [ij. In this scheme (see Fig. [1]), the 
state of X is updated at discrete times to, ti, . . ., with 
Px{ti) — >■ p'xi^i) = ^i{px{ti))- In between these changes, 
information can be read from X through an interaction 
with a reservoir. For this we introduce the map TZx- The 
evolution of X is then written formally as • • • — > px (U) — >■ 

p'M = £^iPxit^)) ^ Px{U+l) = TZxip'xiU)) ■ ■ ■ ■ 

The quantum mechanical analogue of a "passive learn- 
ing" scenario [l| , where S may not change X (contrast an 
"interactive learning" scenario 2^), is then a situation 
where S retrieves information about p'x by interaction 
with the same reservoir in such a way that the evolution 
of X is unchanged. In summary, we consider an evolution 
for S plus X: 

Psx{U) p'sxi^t) = Is<^ £i{psx{U)), (5a) 

p'sxiU) ^ PsxiU+i) = TZsxip'sxiU)), (5b) 

where Is denotes the identity on 5* and trs[T^sx{p{t))] = 
■Rxipxit)). 
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FIG. 1: (Color online.) Schematic of the setup. System S 
gathers information about X through a common reservoir in- 
teraction (map TZsx), while X is driven from equilibrium by 
the maps £i. The evolution of pxif) is not modified by the 
presence of S. The changing of S due to the evolution of X 
is interpreted as an implicit computational model. 



Note that Eqs. ([5a|) and (|5bp imply a separation of time 
scales: the update on X happens much faster than any 
timescale associated with the reservoir interaction and 
response of S; the state of S is not changed during the 

update— ps(ii) = trx[psx(ii)] = ^^x[p'sxi^i)\- We may 
then, as in the classical case, view Eq. ((5a|) as a (fast) 
"work step" and Eq. (j5bp as a (slow) "relaxation step" . 

We note two further considerations. First, there is a 
departure from the assumption of the classical setup that 
the state of X not change during the relaxation In 
the quantum case the state of X can change. As a result, 
we do not consider the thermodynamic operation of the 
process as a whole, over a sequence of many time steps, 
but limit ourselves to the thermodynamic and predictive 
inefficiency at a single update. Our interest lies with the 
predictive capabilities of S at time t^, when the model en- 
coding the predictive information, psxiti), results from 
the evolution up to time ti. 

Second, it is important to note that we consider a part 
of W\ost{ti) ■■= Wiost[psx{ti) p'sxi'ti)] that arises from 
genuine quantum correlations, present before and after 
each update. This should not be confused with a sit- 
uation where X (or S) evolves according to a classical 
stochastic process. That would entail a state psx{t) = 
J2kPk{t)Pk{t) <8i -Pfc with pk{t) that stochastic process, 
where pfc(t) are density matrices for S, and Pk are rank 
one projectors for X. Importantly, the basis {Pk} is fixed 
for all times in this case, whereas the basis achieving the 
minimum in the definition of discord might be different 
at different times. 

Turning now to an illustration, we consider an exam- 
ple where quantum correlations contribute positively to 
the predictive efficiency. With both S and X taken to 
be qubits, we consider a local map on X shown in (20| to 
produce quantum correlations: £i{p) = KqpKq+KipKI, 
with Krauss operators if = |0)(0|x + -\/l — _p|l)(f|x and 
Ki ~ y^\0){l\x; and we set p = 0.7. We adopt the for- 
malism of cascaded quantum systems to describe the 
response of S. Under this scheme, qubit X couples to a 
zero temperature reservoir R at position z = 0, with 
interaction Hamiltonian Hxb = «\/2k((t^£'^(0) — H.c), 
while S couples to i? at z = /, with interaction Hsr = 
i^/2K{a^£^ (l) — H.c); 2k is the interaction rate for each 
qubit, and are Pauli operators, and £{z) is the 
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FIG. 2: (Color online.) Lost work in the implicit compu- 
tation at the discrete set of times uti; total lost work (blue 
solid bars, positive values), classically lost work (green shaded 
bars), quantum lost work (red solid bars, negative values). 
The inset shows the quantum part only. 



(photon) annihilation operator at z for the reservoir 
mediating the one-way coupling of X to S. The self- 
Hamiltonians of the qubits are set to zero, Hs = Hx = 0. 
We let ^ 0, for simplicity, and describe the system re- 
sponse through the master equation [23| {h = 1) 



p = ^i[H,p]+V[C]psx, 



(6) 



where H 



(cr^ - af), and C 



X 



the superoperator T) is defined through 'D[0]p = OpO^ — 
OtC)p/2 - pOtO/2. 

Starting from an uncorrelated initial state, psx{ta) = 

|o)(o|5 ® (l+X+lx + l-X-U) V2, |±) = (|0) ± |1))/A 

the evolution generates correlations in the system. We 
choose K(ti_|_i— ti) = 1, for all i, and calculate numerically 
Wiost{ti), Wi^^^{ti), and W^^^^iU), considering projective 
measurements on X, i.e., W^^^{ti) := W^^^ {S\X), un- 
der psx{ti) p'sxi^i)- The result appears in Fig.[5J As 
expected the total and classically lost work are positive, 
while the quantum part is negative for all ti > to. The 
efficiency is zero in the uncorrelated initial state, beyond 
which the relaxation process, Eq. ([5]), creates correlations 
and allow the model to be predictive; it may be thought 
of as the "learning" part of the implicit computation. 

We have shown that as a quantum system X changes 
state, the non-predictive quantum information held by a 
correlated system S equals the lost work potential from 
S. In particular, if X changes to a classical state which 
minimizes the lost work, this lost work is given by the 
quantum discord present before the change. More gener- 
ally, quantum discord quantifies the contribution to the 
lost work coming from quantum correlations. We have 
demonstrated that such correlations can contribute pos- 
itively to the thermodynamic operation. 

The author would like to thank Howard Carmichael, 
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and Susanna Still for helpful discussions. 



* Electronic address: |arne.grimsmo@ ntnu.no| 
[1] S. Still, D. A. Sivak,"A. J. BeU, and G. E. Crooks, Phys. 

Rev. Lett. 109, 120604 (2012). 
[2] S. StiU, J. R Crutchfield, and C. J. Ellison, Chaos: An 

Interdisciplinary Journal of Nonlinear Science 20, 037111 

(2010) . 

[3] J. S. BeU, Physics (Long Island City, N.Y.) 1, 195 (1964). 
[4] H. QUivier and W. H. Zurek, Phys. Rev. Lett. 88, 017901 
(2001). 

[5] L. d. Rio, J. Aberg, R. Renner, O. Dahlsten, and V. Ve- 

dral. Nature 474, 61 (2011). 
[6] R. Landauer, IBM journal of research and development 

5, 183 (1961). 

[7] N. Tishby, F. Pereira, and W. Bialek, Proceedings of the 
37th Annual AUerton Conference pp. 368-377 (1999). 

[8] Being "kept intact" means that, assuming S, X, and the 
rest of the enivronment, say R, make up a total pure 
state, then the reduced state of X + i? is not affected by 
the work extraction. 

[9] D. Egloff, O. C. O. Dahlsten, R. Renner, and V. Vedral, 
arXiv: 1207.0434. 
[10] M. A. Nielsen and I. L. Chuang, Quantum Computation 
and Quantum Information ( Cambridge Series on Infor- 
mation and the Natural Sciences) (Cambridge University 
Press, 2004). 

[11] Interestingly, we note that Wiost as defined here can be 
negative due to entanglement generation if one allows 
global maps; for example, if psx is pure, H{S\X) is neg- 
ative if and only if the state is entangled; it follows that 
if the quantum channel maps a pure unentangled state 
onto a pure entangled state, Wiost < 0, and net work is 
gained. 

[12] W. H. Zurek, Phys. Rev. A 67, 012320 (2003). 

[13] K. Modi, T. Paterek, W. Son, V. Vedral, and 

M. Williamson, Phys. Rev. Lett. 104, 080501 (2010). 
[14] D. Cavalcanti, L. Aolita, S. Boixo, K. Modi, M. Piani, 

and A. Winter, Phys. Rev. A 83, 032324 (2011). 
[15] V. Madhok and A. Datta, Phys. Rev. A 83, 032323 

(2011) . 

[16] M. Gu, H. M. Chrzanowski, S. M. Assad, T. Symul, 
K. Modi, T. C. Ralph, V. Vedral, and P. K. Lam, Nature 
Physics 8, 671 (2012). 

[17] L. Henderson and V. Vedral, Journal of Physics A: Math- 
ematical and General 34, 6899 (2001). 

[18] A. Ferraro, L. Aolita, D. Cavalcanti, F. M. Cucchietti, 
and A. Acm, Phys. Rev. A 81, 052318 (2010). 

[19] A. Streltsov, H. Kampermann, and D. Brufi, Phys. Rev. 
Lett. 107, 170502 (2011). 

[20] F. Ciccarello and V. Giovannetti, Phys. Rev. A 85, 
010102 (2012). 

[21] V. Vedral, M. B. Plenio, M. A. Rippin, and P. L. Knight, 
Phys. Rev. Lett. 78, 2275 (1997). 

[22] S. StiU, EPL (Europhysics Letters) 85, 28005 (2009). 

[23] Write [H 5{X\S) = H{psx\\x) - H{^pk\k){k\) + 
II{ps), where II{p3x\\x) is a relative entropy and x = 
'}2Pk\k){k\ (g) is the state that minimizes the dis- 
cord. Define x' = Is ® £{x) = J2Pk\k){k\ (g) £{p^), and 
S{X'\S)^, ■.= Hip'sx\\x')-Hij:Pk\k){k\) + Hips).Note 
that relative entropy is monotonic under a completely 
positive trace preserving map: H {£ {p)\\£ (a)) < II{p\\a) 
[13]. It follows that S{X'\S)^, < S{X\S). From this the 



5 

claimed result is proved since 5{X'\S) < S{X'\S)^> by [24] H. J. Carmichael, Phys. Rev. Lett. 70, 2273 (1993). 
construction. 



